Overview

Dataset statistics

Number of variables28
Number of observations26969
Missing cells32
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.8 MiB
Average record size in memory224.0 B

Variable types

CAT16
NUM10
BOOL2

Reproduction

Analysis started2020-09-21 17:00:31.509054
Analysis finished2020-09-21 17:00:59.720593
Duration28.21 seconds
Versionpandas-profiling v2.8.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml

Warnings

df_index has constant value "0" Constant
Date & Time has a high cardinality: 22997 distinct values High cardinality
Type of aircraft has a high cardinality: 1085 distinct values High cardinality
Operator has a high cardinality: 7837 distinct values High cardinality
Registration has a high cardinality: 25839 distinct values High cardinality
Schedule has a high cardinality: 14646 distinct values High cardinality
MSN has a high cardinality: 17759 distinct values High cardinality
Location has a high cardinality: 12845 distinct values High cardinality
Country has a high cardinality: 218 distinct values High cardinality
Circumstances has a high cardinality: 21207 distinct values High cardinality
FlightDate has a high cardinality: 17590 distinct values High cardinality
FlightOperator has a high cardinality: 7836 distinct values High cardinality
src_rowid is highly correlated with rowidHigh correlation
rowid is highly correlated with src_rowidHigh correlation
Total fatalities is highly correlated with Pax fatalitiesHigh correlation
Pax fatalities is highly correlated with Total fatalitiesHigh correlation
Crew on board is highly skewed (γ1 = 149.7973589) Skewed
Other fatalities is highly skewed (γ1 = 61.95186755) Skewed
Date & Time is uniformly distributed Uniform
Registration is uniformly distributed Uniform
FlightDate is uniformly distributed Uniform
rowid has unique values Unique
src_rowid has unique values Unique
dt has unique values Unique
Crew on board has 3480 (12.9%) zeros Zeros
Crew fatalities has 13596 (50.4%) zeros Zeros
Pax on board has 14091 (52.2%) zeros Zeros
Pax fatalities has 20401 (75.6%) zeros Zeros
Other fatalities has 26634 (98.8%) zeros Zeros
Total fatalities has 11108 (41.2%) zeros Zeros
PlaneAge has 1979 (7.3%) zeros Zeros

Variables

rowid
Real number (ℝ≥0)

HIGH CORRELATION
UNIQUE

Distinct count26969
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13485.041232526233
Minimum1
Maximum26970
Zeros0
Zeros (%)0.0%
Memory size210.7 KiB

Quantile statistics

Minimum1
5-th percentile1349.4
Q16743
median13485
Q320227
95-th percentile25620.6
Maximum26970
Range26969
Interquartile range (IQR)13484

Descriptive statistics

Standard deviation7785.492517
Coefficient of variation (CV)0.5773428781
Kurtosis-1.199966116
Mean13485.04123
Median Absolute Deviation (MAD)6742
Skewness2.795850132e-05
Sum363678077
Variance60613893.73
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
20471< 0.1%
 
75291< 0.1%
 
33711< 0.1%
 
136121< 0.1%
 
156611< 0.1%
 
95181< 0.1%
 
115671< 0.1%
 
218241< 0.1%
 
238731< 0.1%
 
177301< 0.1%
 
Other values (26959)26959> 99.9%
 
ValueCountFrequency (%) 
11< 0.1%
 
21< 0.1%
 
31< 0.1%
 
41< 0.1%
 
51< 0.1%
 
ValueCountFrequency (%) 
269701< 0.1%
 
269691< 0.1%
 
269681< 0.1%
 
269671< 0.1%
 
269661< 0.1%
 

src_rowid
Real number (ℝ≥0)

HIGH CORRELATION
UNIQUE

Distinct count26969
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13485.041232526233
Minimum1
Maximum26970
Zeros0
Zeros (%)0.0%
Memory size210.7 KiB

Quantile statistics

Minimum1
5-th percentile1349.4
Q16743
median13485
Q320227
95-th percentile25620.6
Maximum26970
Range26969
Interquartile range (IQR)13484

Descriptive statistics

Standard deviation7785.492517
Coefficient of variation (CV)0.5773428781
Kurtosis-1.199966116
Mean13485.04123
Median Absolute Deviation (MAD)6742
Skewness2.795850132e-05
Sum363678077
Variance60613893.73
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
20471< 0.1%
 
75291< 0.1%
 
33711< 0.1%
 
136121< 0.1%
 
156611< 0.1%
 
95181< 0.1%
 
115671< 0.1%
 
218241< 0.1%
 
238731< 0.1%
 
177301< 0.1%
 
Other values (26959)26959> 99.9%
 
ValueCountFrequency (%) 
11< 0.1%
 
21< 0.1%
 
31< 0.1%
 
41< 0.1%
 
51< 0.1%
 
ValueCountFrequency (%) 
269701< 0.1%
 
269691< 0.1%
 
269681< 0.1%
 
269671< 0.1%
 
269661< 0.1%
 

dt
Categorical

UNIQUE

Distinct count26969
Unique (%)100.0%
Missing0
Missing (%)0.0%
Memory size210.7 KiB
2020-09-20 20:45:25.443000
 
1
2020-09-20 18:37:07.430000
 
1
2020-09-20 15:16:08.957000
 
1
2020-09-20 17:10:10.123000
 
1
2020-09-20 22:00:30.400000
 
1
Other values (26964)
26964
ValueCountFrequency (%) 
2020-09-20 20:45:25.4430001< 0.1%
 
2020-09-20 18:37:07.4300001< 0.1%
 
2020-09-20 15:16:08.9570001< 0.1%
 
2020-09-20 17:10:10.1230001< 0.1%
 
2020-09-20 22:00:30.4000001< 0.1%
 
2020-09-20 16:43:52.0200001< 0.1%
 
2020-09-20 19:34:38.7470001< 0.1%
 
2020-09-20 15:25:45.6300001< 0.1%
 
2020-09-20 19:12:37.9500001< 0.1%
 
2020-09-20 16:21:36.6570001< 0.1%
 
Other values (26959)26959> 99.9%
 

Length

Max length26
Median length26
Mean length25.97041047
Min length19

df_index
Boolean

CONSTANT
REJECTED

Distinct count1
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size210.7 KiB
0
26969
ValueCountFrequency (%) 
026969100.0%
 

Date & Time
Categorical

HIGH CARDINALITY
UNIFORM

Distinct count22997
Unique (%)85.3%
Missing0
Missing (%)0.0%
Memory size210.7 KiB
Mar 24, 1945
 
15
Mar 6, 1944
 
11
Jun 6, 1944
 
10
Dec 18, 1939
 
10
Dec 31, 1935
 
9
Other values (22992)
26914
ValueCountFrequency (%) 
Mar 24, 1945150.1%
 
Mar 6, 194411< 0.1%
 
Jun 6, 194410< 0.1%
 
Dec 18, 193910< 0.1%
 
Dec 31, 19359< 0.1%
 
Sep 16, 19439< 0.1%
 
Mar 24, 19448< 0.1%
 
Apr 16, 19448< 0.1%
 
Jul 11, 19438< 0.1%
 
Jan 6, 19457< 0.1%
 
Other values (22987)2687499.6%
 

Length

Max length23
Median length22
Mean length17.28425229
Min length11

Type of aircraft
Categorical

HIGH CARDINALITY

Distinct count1085
Unique (%)4.0%
Missing0
Missing (%)0.0%
Memory size210.7 KiB
Douglas C-47 Skytrain (DC-3)
 
2164
PZL-Mielec AN-2
 
761
Curtiss C-46 Commando
 
634
Avro 652 Anson
 
532
Douglas DC-3
 
423
Other values (1080)
22455
ValueCountFrequency (%) 
Douglas C-47 Skytrain (DC-3)21648.0%
 
PZL-Mielec AN-27612.8%
 
Curtiss C-46 Commando6342.4%
 
Avro 652 Anson5322.0%
 
Douglas DC-34231.6%
 
De Havilland DH.60 Moth3571.3%
 
De Havilland DHC-2 Beaver3391.3%
 
Piper PA-31-350 Navajo Chieftain3331.2%
 
Britten-Norman Islander3281.2%
 
Lockheed C-130 Hercules3211.2%
 
Other values (1075)2077777.0%
 

Length

Max length43
Median length21
Mean length20.37713671
Min length4

Operator
Categorical

HIGH CARDINALITY

Distinct count7837
Unique (%)29.1%
Missing0
Missing (%)0.0%
Memory size210.7 KiB
Royal Air Force - RAF (31257)
 
2247
United States Air Force - USAF (since 1947) (31252)
 
1443
Aeroflot - Russian International Airlines (30808)
 
1358
United States Army Air Forces - USAAF (1941-1947) (39359)
 
1350
United States Navy - USN (37278)
 
651
Other values (7832)
19920
ValueCountFrequency (%) 
Royal Air Force - RAF (31257)22478.3%
 
United States Air Force - USAF (since 1947) (31252)14435.4%
 
Aeroflot - Russian International Airlines (30808)13585.0%
 
United States Army Air Forces - USAAF (1941-1947) (39359)13505.0%
 
United States Navy - USN (37278)6512.4%
 
Private American (31833)4031.5%
 
Royal Australian Air Force - RAAF (31800)2671.0%
 
French Air Force - Armée de l'Air (31256)1910.7%
 
Royal Canadian Air Force - RCAF (31234)1750.6%
 
Brazilian Air Force - Força Aérea Brasileira (31232)1640.6%
 
Other values (7827)1872069.4%
 

Length

Max length98
Median length32
Mean length36.07504913
Min length11

Registration
Categorical

HIGH CARDINALITY
UNIFORM

Distinct count25839
Unique (%)95.8%
Missing0
Missing (%)0.0%
Memory size210.7 KiB
2
 
6
1
 
6
48
 
5
29
 
5
1H+?S
 
5
Other values (25834)
26942
ValueCountFrequency (%) 
26< 0.1%
 
16< 0.1%
 
485< 0.1%
 
295< 0.1%
 
1H+?S5< 0.1%
 
754< 0.1%
 
264< 0.1%
 
1044< 0.1%
 
74< 0.1%
 
404< 0.1%
 
Other values (25829)2692299.8%
 

Length

Max length15
Median length6
Mean length6.217101116
Min length1

Flight Phase
Categorical

Distinct count5
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size210.7 KiB
Flight
10930
Landing (descent or approach)
9808
Takeoff (climb)
5898
Taxiing
 
229
Parking
 
104
ValueCountFrequency (%) 
Flight1093040.5%
 
Landing (descent or approach)980836.4%
 
Takeoff (climb)589821.9%
 
Taxiing2290.8%
 
Parking1040.4%
 

Length

Max length29
Median length15
Mean length16.34517409
Min length6

Flight Type
Categorical

Distinct count31
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size210.7 KiB
Scheduled Revenue Flight
6122
Military
4476
Training
3105
Cargo
2719
Private
1955
Other values (26)
8592
ValueCountFrequency (%) 
Scheduled Revenue Flight612222.7%
 
Military447616.6%
 
Training310511.5%
 
Cargo271910.1%
 
Private19557.2%
 
Charter/Taxi (Non Scheduled Revenue Flight)15945.9%
 
Executive9753.6%
 
Survey / Patrol / Reconnaissance9423.5%
 
Bombing6782.5%
 
Positioning6332.3%
 
Other values (21)377014.0%
 

Length

Max length43
Median length8
Mean length14.74233379
Min length4

Survivors
Boolean

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size210.7 KiB
Yes
15275
No
11694
ValueCountFrequency (%) 
Yes1527556.6%
 
No1169443.4%
 

Site
Categorical

Distinct count6
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size210.7 KiB
Airport (less than 10 km from airport)
13136
Plain, Valley
6260
Lake, Sea, Ocean, River
3633
Mountains
3295
City
 
443
ValueCountFrequency (%) 
Airport (less than 10 km from airport)1313648.7%
 
Plain, Valley626023.2%
 
Lake, Sea, Ocean, River363313.5%
 
Mountains329512.2%
 
City4431.6%
 
Desert2020.7%
 

Length

Max length38
Median length23
Mean length25.8350699
Min length4

Schedule
Categorical

HIGH CARDINALITY

Distinct count14646
Unique (%)54.3%
Missing1
Missing (%)< 0.1%
Memory size210.7 KiB
Point Cook - Point Cook
 
49
Mildenhall - Mildenhall
 
47
Waddington - Waddington
 
47
Paris - Croydon
 
44
Scampton - Scampton
 
44
Other values (14641)
26737
ValueCountFrequency (%) 
Point Cook - Point Cook490.2%
 
Mildenhall - Mildenhall470.2%
 
Waddington - Waddington470.2%
 
Paris - Croydon440.2%
 
Scampton - Scampton440.2%
 
Coningsby - Coningsby430.2%
 
Silloth - Silloth430.2%
 
Moscow - Moscow350.1%
 
Swinderby - Swinderby340.1%
 
Kinloss - Kinloss320.1%
 
Other values (14636)2655098.4%
 

Length

Max length244
Median length20
Mean length22.81423115
Min length3

MSN
Categorical

HIGH CARDINALITY

Distinct count17759
Unique (%)65.8%
Missing0
Missing (%)0.0%
Memory size210.7 KiB
01
 
67
2
 
51
1
 
50
10
 
40
14
 
38
Other values (17754)
26723
ValueCountFrequency (%) 
01670.2%
 
2510.2%
 
1500.2%
 
10400.1%
 
14380.1%
 
3370.1%
 
15330.1%
 
6280.1%
 
25280.1%
 
4260.1%
 
Other values (17749)2657198.5%
 

Length

Max length17
Median length5
Mean length5.480588824
Min length1

YOM
Real number (ℝ≥0)

Distinct count104
Unique (%)0.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1955.4122511031185
Minimum1911
Maximum2018
Zeros0
Zeros (%)0.0%
Memory size210.7 KiB

Quantile statistics

Minimum1911
5-th percentile1928
Q11943
median1951
Q31970
95-th percentile1988
Maximum2018
Range107
Interquartile range (IQR)27

Descriptive statistics

Standard deviation19.0618194
Coefficient of variation (CV)0.009748235641
Kurtosis-0.4784724193
Mean1955.412251
Median Absolute Deviation (MAD)13
Skewness0.4118138292
Sum52735513
Variance363.3529588
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1944277810.3%
 
194315495.7%
 
194511494.3%
 
19418173.0%
 
19427112.6%
 
19285982.2%
 
19695702.1%
 
19685512.0%
 
19744681.7%
 
19404561.7%
 
Other values (94)1732264.2%
 
ValueCountFrequency (%) 
19111< 0.1%
 
19132< 0.1%
 
19161< 0.1%
 
1918460.2%
 
19191820.7%
 
ValueCountFrequency (%) 
20183< 0.1%
 
20173< 0.1%
 
20165< 0.1%
 
201513< 0.1%
 
20149< 0.1%
 

Location
Categorical

HIGH CARDINALITY

Distinct count12845
Unique (%)47.6%
Missing0
Missing (%)0.0%
Memory size210.7 KiB
Atlantic Ocean All World
 
138
Pacific Ocean All World
 
132
Russia All Russia
 
102
North Sea All World
 
62
Mediterranean Sea All World
 
48
Other values (12840)
26487
ValueCountFrequency (%) 
Atlantic Ocean All World1380.5%
 
Pacific Ocean All World1320.5%
 
Russia All Russia1020.4%
 
North Sea All World620.2%
 
Mediterranean Sea All World480.2%
 
Mexico All Mexico410.2%
 
Croydon Surrey380.1%
 
United Kingdom All United Kingdom370.1%
 
Fort Lauderdale-Hollywood Florida370.1%
 
Kunming Yunnan370.1%
 
Other values (12835)2629797.5%
 

Length

Max length92
Median length23
Mean length24.99080426
Min length7

Country
Categorical

HIGH CARDINALITY

Distinct count218
Unique (%)0.8%
Missing0
Missing (%)0.0%
Memory size210.7 KiB
United States of America
6337
United Kingdom
 
2167
Russia
 
1440
Canada
 
1314
France
 
811
Other values (213)
14900
ValueCountFrequency (%) 
United States of America633723.5%
 
United Kingdom21678.0%
 
Russia14405.3%
 
Canada13144.9%
 
France8113.0%
 
Brazil7172.7%
 
Australia5972.2%
 
World5972.2%
 
Germany5842.2%
 
Colombia5131.9%
 
Other values (208)1189244.1%
 

Length

Max length30
Median length8
Mean length11.93755794
Min length4

Region
Categorical

Distinct count9
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size210.7 KiB
North America
7666
Europe
6236
Asia
5497
South America
2456
Africa
1984
Other values (4)
3130
ValueCountFrequency (%) 
North America766628.4%
 
Europe623623.1%
 
Asia549720.4%
 
South America24569.1%
 
Africa19847.4%
 
Oceania12764.7%
 
Central America12004.4%
 
World5982.2%
 
Antarctica560.2%
 

Length

Max length15
Median length6
Mean length8.653491045
Min length4

Crew on board
Real number (ℝ≥0)

SKEWED
ZEROS

Distinct count29
Unique (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.1318550928844227
Minimum0
Maximum1920
Zeros3480
Zeros (%)12.9%
Memory size210.7 KiB

Quantile statistics

Minimum0
5-th percentile0
Q11
median2
Q34
95-th percentile9
Maximum1920
Range1920
Interquartile range (IQR)3

Descriptive statistics

Standard deviation12.03736754
Coefficient of variation (CV)3.843526339
Kurtosis23847.36687
Mean3.131855093
Median Absolute Deviation (MAD)1
Skewness149.7973589
Sum84463
Variance144.8982173
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1596222.1%
 
2549620.4%
 
0348012.9%
 
3310011.5%
 
426499.8%
 
518757.0%
 
613415.0%
 
79643.6%
 
86542.4%
 
93791.4%
 
Other values (19)10694.0%
 
ValueCountFrequency (%) 
0348012.9%
 
1596222.1%
 
2549620.4%
 
3310011.5%
 
426499.8%
 
ValueCountFrequency (%) 
19201< 0.1%
 
1071< 0.1%
 
481< 0.1%
 
371< 0.1%
 
251< 0.1%
 

Crew fatalities
Real number (ℝ≥0)

ZEROS

Distinct count25
Unique (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.6889391523601172
Minimum0
Maximum25
Zeros13596
Zeros (%)50.4%
Memory size210.7 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q33
95-th percentile7
Maximum25
Range25
Interquartile range (IQR)3

Descriptive statistics

Standard deviation2.545561619
Coefficient of variation (CV)1.507195576
Kurtosis5.704845242
Mean1.688939152
Median Absolute Deviation (MAD)0
Skewness2.115337858
Sum45549
Variance6.479883957
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
01359650.4%
 
1374313.9%
 
2282210.5%
 
318256.8%
 
415675.8%
 
510684.0%
 
66952.6%
 
75151.9%
 
83641.3%
 
92460.9%
 
Other values (15)5282.0%
 
ValueCountFrequency (%) 
01359650.4%
 
1374313.9%
 
2282210.5%
 
318256.8%
 
415675.8%
 
ValueCountFrequency (%) 
251< 0.1%
 
233< 0.1%
 
221< 0.1%
 
212< 0.1%
 
201< 0.1%
 

Pax on board
Real number (ℝ≥0)

ZEROS

Distinct count254
Unique (%)0.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8.988987355853016
Minimum0
Maximum509
Zeros14091
Zeros (%)52.2%
Memory size210.7 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q36
95-th percentile45
Maximum509
Range509
Interquartile range (IQR)6

Descriptive statistics

Standard deviation26.27983587
Coefficient of variation (CV)2.92355911
Kurtosis49.39493322
Mean8.988987356
Median Absolute Deviation (MAD)0
Skewness5.954956288
Sum242424
Variance690.6297734
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
01409152.2%
 
119037.1%
 
214365.3%
 
310744.0%
 
49193.4%
 
57792.9%
 
65562.1%
 
74901.8%
 
84531.7%
 
93741.4%
 
Other values (244)489418.1%
 
ValueCountFrequency (%) 
01409152.2%
 
119037.1%
 
214365.3%
 
310744.0%
 
49193.4%
 
ValueCountFrequency (%) 
5091< 0.1%
 
4511< 0.1%
 
3841< 0.1%
 
3811< 0.1%
 
3801< 0.1%
 

Pax fatalities
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct count181
Unique (%)0.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.3811413103934145
Minimum0
Maximum506
Zeros20401
Zeros (%)75.6%
Memory size210.7 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile17
Maximum506
Range506
Interquartile range (IQR)0

Descriptive statistics

Standard deviation14.72485116
Coefficient of variation (CV)4.354994309
Kurtosis165.6564741
Mean3.38114131
Median Absolute Deviation (MAD)0
Skewness10.36162094
Sum91186
Variance216.8212418
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
02040175.6%
 
112614.7%
 
28223.0%
 
35752.1%
 
45111.9%
 
53741.4%
 
62721.0%
 
72340.9%
 
82150.8%
 
91710.6%
 
Other values (171)21337.9%
 
ValueCountFrequency (%) 
02040175.6%
 
112614.7%
 
28223.0%
 
35752.1%
 
45111.9%
 
ValueCountFrequency (%) 
5061< 0.1%
 
3341< 0.1%
 
3261< 0.1%
 
3071< 0.1%
 
2891< 0.1%
 

Other fatalities
Real number (ℝ≥0)

SKEWED
ZEROS

Distinct count40
Unique (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.09781601097556454
Minimum0
Maximum237
Zeros26634
Zeros (%)98.8%
Memory size210.7 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum237
Range237
Interquartile range (IQR)0

Descriptive statistics

Standard deviation2.412807292
Coefficient of variation (CV)24.66679297
Kurtosis4989.150825
Mean0.09781601098
Median Absolute Deviation (MAD)0
Skewness61.95186755
Sum2638
Variance5.82163903
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
02663498.8%
 
11170.4%
 
2480.2%
 
3370.1%
 
4260.1%
 
5150.1%
 
613< 0.1%
 
710< 0.1%
 
108< 0.1%
 
87< 0.1%
 
Other values (30)540.2%
 
ValueCountFrequency (%) 
02663498.8%
 
11170.4%
 
2480.2%
 
3370.1%
 
4260.1%
 
ValueCountFrequency (%) 
2371< 0.1%
 
1801< 0.1%
 
1101< 0.1%
 
1071< 0.1%
 
711< 0.1%
 

Total fatalities
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct count201
Unique (%)0.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.806703993473989
Minimum0
Maximum520
Zeros11108
Zeros (%)41.2%
Memory size210.7 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median1
Q35
95-th percentile24
Maximum520
Range520
Interquartile range (IQR)5

Descriptive statistics

Standard deviation17.10951531
Coefficient of variation (CV)2.946510677
Kurtosis117.8017019
Mean5.806703993
Median Absolute Deviation (MAD)1
Skewness8.755873054
Sum156601
Variance292.7355142
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
01110841.2%
 
126589.9%
 
224589.1%
 
317756.6%
 
415665.8%
 
512634.7%
 
69263.4%
 
77462.8%
 
85782.1%
 
94391.6%
 
Other values (191)345212.8%
 
ValueCountFrequency (%) 
01110841.2%
 
126589.9%
 
224589.1%
 
317756.6%
 
415665.8%
 
ValueCountFrequency (%) 
5201< 0.1%
 
3461< 0.1%
 
3351< 0.1%
 
3291< 0.1%
 
3121< 0.1%
 

Circumstances
Categorical

HIGH CARDINALITY

Distinct count21207
Unique (%)78.7%
Missing31
Missing (%)0.1%
Memory size210.7 KiB
Engine failure.
 
546
Crashed in unknown circumstances.
 
214
Shot down by enemy fire.
 
187
Crash on a mountain 8 minutes after takeoff from Nausori Airport. All 17 occupants were killed.
 
180
Crashed in unknown circumstances in the Gulf of Mexico while completing a training mission. All three crew members, two pilots and an instructor, were killed. Crew: Cpt John Krafft, 1st Lt Ronald Pahl, Ltjg Robert Roch.
 
157
Other values (21202)
25654
ValueCountFrequency (%) 
Engine failure.5462.0%
 
Crashed in unknown circumstances.2140.8%
 
Shot down by enemy fire.1870.7%
 
Crash on a mountain 8 minutes after takeoff from Nausori Airport. All 17 occupants were killed.1800.7%
 
Crashed in unknown circumstances in the Gulf of Mexico while completing a training mission. All three crew members, two pilots and an instructor, were killed. Crew: Cpt John Krafft, 1st Lt Ronald Pahl, Ltjg Robert Roch.1570.6%
 
Fuel exhaustion.1470.5%
 
While flying off Florida coast, the pilot informed the controllers that he was low fuel and he ditched the aircraft 300 meters off shore. The board said that the fuel quantity on board was insufficient for the trip.1230.5%
 
Controlled flight into terrain.1200.4%
 
Shot down by the German Flak.1160.4%
 
It appears that the thrust reverser on left engine was engaged by mistake on approach. This was caused by a mechanical failure.1130.4%
 
Other values (21197)2503592.8%
 

Length

Max length12209
Median length197
Mean length278.6239386
Min length3

FlightDate
Categorical

HIGH CARDINALITY
UNIFORM

Distinct count17590
Unique (%)65.2%
Missing0
Missing (%)0.0%
Memory size210.7 KiB
1944-06-06
 
23
1943-05-17
 
17
1939-12-18
 
16
1945-03-24
 
16
1944-06-13
 
14
Other values (17585)
26883
ValueCountFrequency (%) 
1944-06-06230.1%
 
1943-05-17170.1%
 
1939-12-18160.1%
 
1945-03-24160.1%
 
1944-06-13140.1%
 
1944-03-0611< 0.1%
 
1944-09-1711< 0.1%
 
1935-12-3110< 0.1%
 
1944-05-2510< 0.1%
 
1943-07-1110< 0.1%
 
Other values (17580)2683199.5%
 

Length

Max length10
Median length10
Mean length10
Min length10

FlightOperator
Categorical

HIGH CARDINALITY

Distinct count7836
Unique (%)29.1%
Missing0
Missing (%)0.0%
Memory size210.7 KiB
Royal Air Force - RAF
 
2247
United States Air Force - USAF (since 1947)
 
1443
Aeroflot - Russian International Airlines
 
1358
United States Army Air Forces - USAAF (1941-1947)
 
1350
United States Navy - USN
 
651
Other values (7831)
19920
ValueCountFrequency (%) 
Royal Air Force - RAF22478.3%
 
United States Air Force - USAF (since 1947)14435.4%
 
Aeroflot - Russian International Airlines13585.0%
 
United States Army Air Forces - USAAF (1941-1947)13505.0%
 
United States Navy - USN6512.4%
 
Private American4031.5%
 
Royal Australian Air Force - RAAF2671.0%
 
French Air Force - Armée de l'Air1910.7%
 
Royal Canadian Air Force - RCAF1750.6%
 
Brazilian Air Force - Força Aérea Brasileira1640.6%
 
Other values (7826)1872069.4%
 

Length

Max length90
Median length24
Mean length28.07504913
Min length3

PlaneAge
Real number (ℝ≥0)

ZEROS

Distinct count80
Unique (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11.84953094293448
Minimum0
Maximum90
Zeros1979
Zeros (%)7.3%
Memory size210.7 KiB

Quantile statistics

Minimum0
5-th percentile0
Q12
median8
Q318
95-th percentile36
Maximum90
Range90
Interquartile range (IQR)16

Descriptive statistics

Standard deviation11.94262431
Coefficient of variation (CV)1.007856291
Kurtosis2.260631843
Mean11.84953094
Median Absolute Deviation (MAD)7
Skewness1.441895748
Sum319570
Variance142.6262755
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1285010.6%
 
220297.5%
 
019797.3%
 
314855.5%
 
413284.9%
 
512714.7%
 
611624.3%
 
710573.9%
 
89623.6%
 
98003.0%
 
Other values (70)1204644.7%
 
ValueCountFrequency (%) 
019797.3%
 
1285010.6%
 
220297.5%
 
314855.5%
 
413284.9%
 
ValueCountFrequency (%) 
901< 0.1%
 
791< 0.1%
 
782< 0.1%
 
771< 0.1%
 
754< 0.1%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

Sample

First rows

rowidsrc_rowiddtdf_indexDate & TimeType of aircraftOperatorRegistrationFlight PhaseFlight TypeSurvivorsSiteScheduleMSNYOMLocationCountryRegionCrew on boardCrew fatalitiesPax on boardPax fatalitiesOther fatalitiesTotal fatalitiesCircumstancesFlightDateFlightOperatorPlaneAge
0112020-09-20 14:38:01.1800000Jul 15, 2020 at 2245 LTBeechcraft 350 Super King AirTurkish Police - Türk Polis (45115)EM-809FlightSurvey / Patrol / ReconnaissanceNoMountainsVan - VanFL-8962015Mt Artos\n\nEastern Anatolia Region (Dogu Anadolu Bölgesi)TurkeyAsia225507The twin engine aircraft departed Van-Ferit Melen Airport at 1834LT on a survey/reconnaissance mission over the province of Hakkari and Van, carrying five passengers and two pilots. At 2232LT, the crew informed ATC about his position vertical to Baskale on approach to Van-Ferit Melen Airport. Thirteen minutes later, the aircraft struck the slope of Mt Artos located 30 km southwest of runway 03 threshold. The aircraft was destroyed upon impact and all seven occupants were killed.2020-07-15Turkish Police - Türk Polis5
1222020-09-20 14:38:01.8930000Jul 14, 2020De Havilland DHC-8-400 (Dash-8)Bluebird Aviation (35301)5Y-VVULanding (descent or approach)CargoYesAirport (less than 10 km from airport)Djibouti City – Beledweyne40082000Beledweyne-Haji-Sheikh Mahmud Hasan (Ugas Khalif)\n\nHiiraan (??????)SomaliaAfrica300000After landing at Beledweyne-Haji-Sheikh Mahmud Hasan (Ugas Khalif) Airport, the aircraft went out of control and came to rest against several earth mounds, bursting into flames. All three crew members managed to escape while the aircraft was destroyed by fire. The crew was completing a cargo flight from Djibouti City on behalf of the African Union Mission to Somalia (AMISOM) and it is believed that the aircraft was carrying food supplies.2020-07-14Bluebird Aviation20
2332020-09-20 14:38:02.5330000Jul 13, 2020 at 0410 LTPZL-Mielec AN-2Zeus-Avia (45102)RA-40851FlightSpraying (Agricultural)NoPlain, ValleyDjibouti City – Beledweyne1G174-471977Kistenevo\n\nNizhny Novgorod oblastRussiaAsia220002The crew was completing a spraying mission near the village of Kistenovo (about 160 km southeast of Nizhny Novgorod). In unclear circumstances, it appears that the aircraft struck high tension wires then crashed in a field, bursting into flames. The captain was seriously injured while the copilot was killed. Two days later, the captain died from his injuries.2020-07-13Zeus-Avia43
3442020-09-20 14:38:03.1970000Jun 15, 2020Gulfstream GIIPrivate American (31833)N27SLLanding (descent or approach)Illegal (smuggling)YesPlain, ValleyDjibouti City – Beledweyne841970Machiques\n\nZuliaVenezuelaSouth America210001The crew was engaged in an illegal trip and elected to land on a remote 'airstrip' located in the region of Machiques. The aircraft crash landed and came to rest, bursting into flames. One pilot was killed and the second was injured.2020-06-15Private American50
4552020-09-20 14:38:03.8230000Jun 14, 2020Embraer EMB-121 XinguOeste Veículos (44860)PT-MBVTakeoff (climb)PrivateNoAirport (less than 10 km from airport)Tangará da Serra – Goiânia121-0531982Tangará da Serra\n\nMato GrossoBrazilSouth America220002Shortly after takeoff, while in initial climb, the twin engine aircraft went out of control, lost altitude and crashed in a cornfield. The aircraft disintegrated on impact and both pilots were killed.2020-06-14Oeste Veículos38
5662020-09-20 14:38:04.4600000Jun 7, 2020 at 0415 LTMitsubishi MU-2 MarquiseMcNeely Charter Service (32666)N44MXTakeoff (climb)CargoNoAirport (less than 10 km from airport)Everett – Huron15261981Sioux Falls\n\nSouth DakotaUnited States of AmericaNorth America110001The pilot departed Everett-Payne Field in the evening of June 6 on a cargo service to Huron, SD. En route, he was informed about the presence of thunderstorms in the Huron area and decided to divert to Sioux Falls Airport where he landed at 0140LT. Awaiting weather improvement, he left Sioux Falls around 0415LT to resume his flight to Huron. Upon takeoff, the twin engine aircraft crashed in unknown circumstances and was destroyed. The pilot was killed.2020-06-07McNeely Charter Service39
6772020-09-20 14:38:05.0670000Jun 5, 2020 at 1513 LTPiper PA-31 CheyenneLarry Ray Pruitt (44800)N135VEFlightPrivateNoPlain, ValleyWilliston – New Castle31-75200241975Eatonton\n\nGeorgiaUnited States of AmericaNorth America114405The twin engine aircraft departed Willison Airport, Florida, at 1401LT bound for New Castle, Indiana, with four passengers and one pilot on board. Almost an hour into the flight, while cruising at an altitude of 25,000 feet, the aircraft entered a right turn then control was lost. While descending, the aircraft lost several pieces (wing parts) and caught fire before crashing in a wooded area located 6 miles northeast of Eatonton, bursting into flames. All five occupants were killed. They were on their way to funeral in Indiana.2020-06-05Larry Ray Pruitt45
7882020-09-20 14:38:05.6700000May 28, 2020 at 1543 LTRockwell Shrike Commander 500SAlaska Department of Natural Resources (Division of Forestry) (41149)N909AKTakeoff (climb)Survey / Patrol / ReconnaissanceYesLake, Sea, Ocean, RiverAniak - Aniak500-32321975Aniak\n\nAlaskaUnited States of AmericaNorth America103000After takeoff from Aniak Airport runway 10, the pilot encountered engine problems. Control was lost and the aircraft crashed in the Aniak River east of the airport. All four occupants were quickly rescued and medevaced to Anchorage. The aircraft was destroyed.2020-05-28Alaska Department of Natural Resources (Division of Forestry)45
8992020-09-20 14:38:06.3130000May 22, 2020 at 1439 LTAirbus A320Pakistan International Airlines - PIA (31061)AP-BLDLanding (descent or approach)Scheduled Revenue FlightYesCityLahore - Karachi22742004Karachi-Muhammad Ali Jinnah-Quaid-e-Azam\n\nSindh (??? ????)PakistanAsia889189198On 22 May 2020 at 13:05 hrs PST, the Pakistan International Airlines aircraft Airbus A320-214, registration number AP-BLD, took off from Lahore (Allama Iqbal International Airport – AIIAP) Pakistan to perform a regular commercial passenger flight (PK8303) to Karachi (Jinnah International Airport – JIAP) Pakistan, with 8 crew members (01 Captain, 01 First Officer, and 06 flight attendants) and 91 passengers on board. At 14:35 hrs the aircraft performed an ILS approach for runway 25L and touched down without landing gears, resting on the engines. Both engines scrubbed the runway at high speed. Flight crew initiated a go-around and informed “Karachi Approach” that they intend to make a second approach. About four minutes later, during downwind leg, at an altitude of around 2000 ft, flight crew declared an emergency and stated that both engines had failed. The aircraft started losing altitude. It crashed in a populated area, short of runway 25L by about 1340 meters. An immediate subsequent post impact fire initiated. Out of 99 souls on-board, 97 were fatally injured and 02 passengers survived. On ground 04 persons were injured however 01 out of these reportedly expired later at a hospital.\n\r\nBelow, the preliminary report published by the Pakistan AAIB.2020-05-22Pakistan International Airlines - PIA16
910102020-09-20 14:38:06.9200000May 12, 2020Quest Kodiak 100Missionary Aviation Fellowship - MAF (31410)PK-MECTakeoff (climb)CargoNoLake, Sea, Ocean, RiverJayapura - Mamit100-00262009Jayapura-Sentani\n\nSpecial Region of PapuaIndonesiaAsia110001Shortly after takeoff from Jayapura-Sentani Airport, while climbing, the pilot sent a brief mayday message when he lost control of the airplane that crashed in Sentani Lake, two minutes after takeoff. The wreckage was found at a depth of 13 meters and the pilot, sole on board, was killed.2020-05-12Missionary Aviation Fellowship - MAF11

Last rows

rowidsrc_rowiddtdf_indexDate & TimeType of aircraftOperatorRegistrationFlight PhaseFlight TypeSurvivorsSiteScheduleMSNYOMLocationCountryRegionCrew on boardCrew fatalitiesPax on boardPax fatalitiesOther fatalitiesTotal fatalitiesCircumstancesFlightDateFlightOperatorPlaneAge
2695926961269612020-09-20 23:43:40.3300000Dec 8, 1918 at 1200 LTDe Havilland DH.4United States Signal Corps - USSC (1914-1919) (39326)97FlightMilitaryNoPlain, ValleySeaton Carew AFB - Seaton Carew AFBB99831918Brookeville\n\nMarylandUnited States of AmericaNorth America100000Pilot was performing a mail flight. He was killed in the accident which occurred in unknown circumstances.1918-12-08United States Signal Corps - USSC (1914-1919)0
2696026962269622020-09-20 23:43:41.2230000Dec 2, 1918 at 1200 LTDe Havilland DH.4United States Signal Corps - USSC (1914-1919) (39326)AS-22846FlightMilitaryNoAirport (less than 10 km from airport)Seaton Carew AFB - Seaton Carew AFBB99831918Wright-Patterson AFB (Dayton)\n\nOhioUnited States of AmericaNorth America000000The accident occurred in unknown circumstances.1918-12-02United States Signal Corps - USSC (1914-1919)0
2696126963269632020-09-20 23:43:42.2230000Nov 20, 1918 at 1200 LTDe Havilland DH.4United States Signal Corps - USSC (1914-1919) (39326)AS-39406FlightMilitaryNoPlain, ValleySeaton Carew AFB - Seaton Carew AFBB99831918New Jersey\n\nNew JerseyUnited States of AmericaNorth America000000The accident occurred in unknown circumstances.1918-11-20United States Signal Corps - USSC (1914-1919)0
2696226964269642020-09-20 23:43:43.4200000Nov 12, 1918 at 1200 LTDe Havilland DH.4United States Signal Corps - USSC (1914-1919) (39326)SC-39168FlightMilitaryNoAirport (less than 10 km from airport)Seaton Carew AFB - Seaton Carew AFBB99831918Fort Worth\n\nTexasUnited States of AmericaNorth America000000The aircraft crashed in unknow circumstances near Everman-Barron Field airport.1918-11-12United States Signal Corps - USSC (1914-1919)0
2696326965269652020-09-20 23:43:44.4670000Nov 12, 1918 at 1200 LTDe Havilland DH.4United States Signal Corps - USSC (1914-1919) (39326)SC-39168FlightMilitaryNoAirport (less than 10 km from airport)Seaton Carew AFB - Seaton Carew AFBB99831918Detroit\n\nMichiganUnited States of AmericaNorth America000000The aircraft crashed in unknow circumstances near Everman-Barron Field airport.1918-11-12United States Signal Corps - USSC (1914-1919)0
2696426966269662020-09-20 23:43:45.5900000Nov 9, 1918 at 1200 LTDe Havilland DH.4United States Signal Corps - USSC (1914-1919) (39326)AS-23130FlightMilitaryNoAirport (less than 10 km from airport)Seaton Carew AFB - Seaton Carew AFBB99831918Langley AFB (Hampton)\n\nVirginiaUnited States of AmericaNorth America000000The aircraft crashed in unknown circumstances.1918-11-09United States Signal Corps - USSC (1914-1919)0
2696526967269672020-09-20 23:43:46.6670000Sep 13, 1918 at 1200 LTDe Havilland DH.9Royal Air Force - RAF (31257)D1745FlightMilitaryYesPlain, ValleySeaton Carew AFB - Seaton Carew AFBB99831918United Kingdom\n\nAll United KingdomUnited KingdomEurope100000The pilot tried to return to his base but due to low visibility by night, he lost his orientation. He elected to make an emergency landing in an open field but the aircraft hit a tree and crashed. The pilot was injured.1918-09-13Royal Air Force - RAF0
2696626968269682020-09-20 23:43:47.8130000Aug 26, 1918Blackburn RT.1 KangarooRoyal Air Force - RAF (31257)B9976Landing (descent or approach)MilitaryYesAirport (less than 10 km from airport)Seaton Carew AFB - Seaton Carew AFBB99761918Seaton Carew AFB\n\nDurhamUnited KingdomEurope200000On final approach in bad visibility, aircraft was too low. It struck the ground short of runway and crashed. Both occupants were injured. Crew was performing a training flight on behalf of the 246th Squadron.1918-08-26Royal Air Force - RAF0
2696726969269692020-09-20 23:43:48.7870000Jun 24, 1918 at 1200 LTBreguet 14French Air Force - Armée de l'Air (31256)AS-4130Landing (descent or approach)MilitaryYesAirport (less than 10 km from airport)Seaton Carew AFB - Seaton Carew AFBB99761918France\n\nAll FranceFranceEurope200000The aircraft crashed in unknown circumstances.1918-06-24French Air Force - Armée de l'Air0
2696826970269702020-09-20 23:43:49.8000000Jun 19, 1918De Havilland DH.4United States Signal Corps - USSC (1914-1919) (39326)AS-32098FlightMilitaryNoAirport (less than 10 km from airport)Wright Patterson AFB-Wright Patterson AFBB99761918Wright-Patterson AFB (Dayton)\n\nOhioUnited States of AmericaNorth America110001Lt. Frank Stuart Patterson, son and nephew of the co-founders of National Cash Register, is killed in the crash of his DH.4M, AS-32098, at Wilbur Wright Field during a flight test of a new mechanism for synchronizing machine gun and propeller, when a tie rod breaks during a dive from 15,000 feet (4,600 m), causing the wings to separate from the aircraft. Wishing to recognize the contributions of the Patterson family (owners of NCR) the area of Wright Field east of Huffman Dam (including Wilbur Wright Field, Fairfield Air Depot, and the Huffman Prairie) is renamed Patterson Field on 6 July 1931, in honor of Lt. Patterson.1918-06-19United States Signal Corps - USSC (1914-1919)0